Extended maximum a posterior linear regression (EMAPLR) model adaptation for speech recognition

نویسندگان

  • Wu Chou
  • Olivier Siohan
  • Tor André Myrvoll
  • Chin-Hui Lee
چکیده

In this paper, a new approach for model adaptation, extended maximum a posterior linear regression (EMAPLR), is described and studied. EMAPLR is an extension of maximum a posterior linear regression (MAPLR) for transform based model adaptation. The proposed approach has a close form solution under the elliptic symmetric matrix variate priors, and it is effective in our speech recognition experiments. EMAPLR is based on a direct MAPLR solution of the transform imageW s without explicitly solving the transformation matrix W . This is fundamentally different from conventional MAPLR and MLLR. Moreover, the proposed EMAPLR approach is incorporated with the structured prior evolution which significantly improves the algorithm efficiency and robustness. The structure of prior evolution in MAPLR is studied and it is shown that under the structured prior evolution, the priors in MAPLR follows a recursive formulation. Experimental results on WSJ (Spoke 3) non-native speaker adaptation task indicates that significant gain over MLLR and MAPLR can be obtained with same amount of adaptation data.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Quasi-Bayes linear regression for sequential learning of hidden Markov models

This paper presents an online/sequential linear regression adaptation framework for hidden Markov model (HMM) based speech recognition. Our attempt is to sequentially improve speaker-independent speech recognition system to handle the nonstationary environments via the linear regression adaptation of HMMs. A quasi-Bayes linear regression (QBLR) algorithm is developed to execute the sequential a...

متن کامل

Online speaker adaptation based on quasi-Bayes linear regression

This paper presents an online/sequential linear regression adaptation framework for hidden Markov model (HMM) based speech recognition. Our attempt is to sequentially improve speaker-independent (SI) speech recognizer to meet nonstationary environments via linear regression adaptation of SI HMM’s. A quasi-Bayes linear regression (QBLR) algorithm is developed to execute online adaptation where t...

متن کامل

Discriminative adaptation for log-linear acoustic models

Log-linear models have recently been used in acoustic modeling for speech recognition systems. This has been motivated by competitive results compared to systems based on Gaussian models, and a more direct parametrisation of the posterior model. To competitively use log-linear models for speech recognition, important methods, such as speaker adaptation, have to be reformulated in a log-linear f...

متن کامل

Maximum a Posterior Linear Regression Based Variance Adaptation of Continuous Density Hmms

In this paper, the theoretical framework of maximum a posterior linear regression (MAPLR) based variance adaptation for continuous density HMMs is described. In our approach, a class of informative prior distribution for MAPLR based variance adaptation is identified, from which the close form solution of MAPLR based variance adaptation is obtained under its EM formulation. Effects of the propos...

متن کامل

Maximum Likelihood Linear Regression (MLLR) for ASR Severity Based Adaptation to Help Dysarthric Speakers

Automatic speech recognition (ASR) for dysarthric speakers is one of the most challenging research areas. The lack of corpus for dysarthric speakers makes it even more difficult. The speaker adaptation (SA) is an alternative solution to overcome the lack of dysarthric speech and enhance the performance of ASR. This paper introduces the Severity-based adaptation, using small amount of speech dat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000